A Preference Ranking Model Using a Discriminatively-trained Classifier
نویسندگان
چکیده
This paper presents an ordinal regression approach to the query-by-description problem. Instead of returning a single classification, such as genre, or a list of the top N songs assumed to be relevant, this algorithm mirrors choices similar to "like", "skip", "play", and "hate" buttons seen on commercial Internet radio stations. Ordinal regression can be viewed as a hybrid between multi-class classification and regression. While ordinal regression defines a small number of discrete classes similar to multi-class classification, there is a dimension of preference which determines the order of classes, which is similar to regression. To model a given song, a transcription of acoustic segment models is derived from an unsupervised approach using hidden Markov models. These transcriptions serve as text documents for latent semantic analysis. A discriminativelytrained ordinal regression algorithm provides the ranks for a given song. Test queries and ground-truth labels are developed using textual musical descriptions from songs to combat the problem that listeners do not base musical similarity solely on acoustic perception. Initial results on a small set of queries indicate the proposed ordinal regression approach performs better than a common perceptron-based ordinal regression algorithm. Further, the results using acoustic segment models is shown to be superior compared to a typical bag-of-frames algorithm using Gaussian mixture models. Extensions to real playlists and user preference indicators are currently in development.
منابع مشابه
Discriminatively trained dependency language modeling for conversational speech recognition
We present a discriminatively trained dependency parserbased language model. The model operates on utterances, rather than words, and so can utilize long-distance structural features of each sentence. We train the model discriminatively on n-best lists, using the perceptron algorithm to tune the model weights. Our features include standard n-gram style features, long-distance co-occurrence feat...
متن کاملTo Track or To Detect? An Ensemble Framework for Optimal Selection
This paper presents a novel approach for multi-target tracking using an ensemble framework that optimally chooses target tracking results from that of independent trackers and a detector at each time step. The ensemble model is designed to select the best candidate scored by a function integrating detection confidence, appearance affinity, and smoothness constraints imposed using geometry and m...
متن کاملAdapting Document Ranking to Users' Preferences Using Click-Through Data
1* Min Zhao is currently researcher at NEC Lab China, Beijing. Abstract. This paper proposes a new approach to ranking the documents retrieved by a search engine using click-through data. The goal is to make the final ranked list of documents accurately represent users’ preferences reflected in the click-through data. Our approach combines the ranking result of a traditional IR algorithm (BM25)...
متن کاملData Sampling and Dimensionality Reduction Approaches for Reranking ASR Outputs Using Discriminative Language Models
This paper investigates various approaches to data sampling and dimensionality reduction for discriminative language models (DLM). Being a feature based language modeling approach, the aim of DLM is to rerank the ASR output with discriminatively trained feature parameters. Using a Turkish morphology based feature set, we examine the use of online Principal Component Analysis (PCA) as a dimensio...
متن کاملBroad phonetic classification using discriminative Bayesian networks
We present an approach to broad phonetic classification, defined as mapping acoustic speech frames into broad (or clustered) phonetic categories. Our categories consist of silence, general voiced, general unvoiced, mixed sounds, voiced closure, and plosive release, and are sufficiently rich to allow accurate time-scaling of speech signals to improve their intelligibility in, e.g. voice-mail app...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008